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Abstract 

C/3 ' The main objective of this two part paper is to formulate and address the problem of distributed 

uplink resource allocation in multi-carrier cognitive radio networks (CRN) with multiple Access Points 
. (APs). When the APs operate on non-overlapping spectrum bands, such problem is essentially a joint 

> 

If^ ■ spectrum decision and spectrum sharing problem. In this network, the cognitive users (CUs) are endowed 



with greater flexibility than the single AP network we considered in the first part of the paper ||2]: they 
can optimize their uplink transmission rates by means of: 1) association to a suitable AP and 2) sharing 

(N : 

, the set of channels that belong to this AP with other CUs associated with this AP. Clearly these two steps 

are interdependent, and our objective is to devise suitable algorithms by which the CUs can perform 
these two steps in a distributed and efficient fashion. 
I In the first part of this paper, we have studied solely the spectrum sharing aspect of the above problem, 

, and proposed algorithms for the CUs in the single AP network to efficiently share the spectrum. In this 

second part of the paper, we build upon our previous understanding of the single AP network, and 
formulate the joint spectrum decision and spectrum sharing problem in a multiple AP network into 
a non-cooperative game, in which the feasible strategy of a player contains a discrete variable (the 
AP/spectrum decision) and a continuous vector (the power allocation among multiple channels). The 
structure of the game is hence very different from most non-cooperative spectrum management game 
proposed in the literature. We provide characterization of the Nash Equilibrium (NE) of this game, and 
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present a set of novel algorithms that allow the CUs to distributively and efficiently select the suitable 
AP and share the channels with other CUs. Finally, we study the properties of the proposed algorithms 
as well as their performance via extensive simulations. 

I. Introduction 

A. Motivation and Related Work 

The objective of this two part paper is to provide the analytical framework as well as the solutions 
to the joint AP selection and power allocation problem in a CRN in the presence of multiple APs. As 
mentioned in the first part of the paper, the need for such joint optimization may arise in a CRN with 
multiple CUs and multiple APs, for example, the IEEE 802.22 cognitive radio Wireless Regional Area 
Network (WRAN) lH. In such network, a particular geographical region may be served by multiple 
service providers (SPs), or by multiple APs installed by a single SP |j4|. Consequently, the CUs, on top 
of being able to share the spectrum offered by a particular SP/AP, also have the flexibility of deciding on 
their SP/AP association. As suggested in [51| and fSl, it would be generally beneficial (in terms of either 
system-wide or individual performance), compared with traditional closest AP assignment strategy, to 
allow the users in multiple AP networks to include the AP association as an additional decision variable. 

In this part of the paper, we consider a CRN with multiple CUs and APs. The available spectrum 
is partitioned by the APs, and they operate on non-overlapping spectrum bands. Each CU's objective 
is to connect to a single AP for communication. The CUs can concurrently use all the channels that 
belong to its associated AP for transmission if desired, but different CUs interfere with each other if 
they use the same channel. In the considered network, the CUs first need to select an appropriate AP for 
communication, a task that can be viewed as a spectrum decision task because the CUs are essentially 
choosing a "best" spectrum band in terms of transmission rate. Then they face the spectrum sharing 
problem when they try to dynamically allocate their communication power across the channels that 
belong to the selected AP. Clearly for a fixed system wide CU-AP association, the entire network is 
reduced to multiple single AP sub-networks, and the (near-) optimal spectrum sharing solution for each 
of the sub-network is studied in the first part of this paper ||2l. Although as we have demonstrated that for 
each sub-network, our proposed spectrum sharing scheme has the potential of maximizing the achievable 
sum rate, in a multiple AP network, the system performance is inevitably tied to the quality of the system 
wide CU-AP association decision as well. A bad CU-AP association decision will result in unsatisfactory 
system performance regardless of the underlying sharing scheme. Consequently, the association problem 
and the sharing problem are strongly interdependent, and in this part of the paper, we intend to propose 
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distributed and efficient algorithm for the CUs in the network to carry out both of the tasks of AP 
association (spectrum decision) and power allocation (spectrum sharing). 

A related problem of joint cell selection/base station (BS) association and power control has been 
addressed in infrastructure-based cellular networks. [2] and (SI are early works trying to tackle this 
problem in an uplink spread spectrum cellular network. The objectives are to let the users find a best 
site selection and power allocation tuple such that all users' target signal to interference ratio (SIR) 
are met, and each user's transmission power is minimized. The authors of H] and IH cast a similar 
problem (with an objective to maximize individual power efficiency or minimizing individual cost) into 
game theoretical frameworks, and propose algorithms to find the Nash Equilibrium (NE) of the proposed 
games. One of the most important differences between our work and the above cited works is that the 
power allocation problems in these works are essentially scalar value optimization problem: each user 
only needs to decide on its power level once a BS is selected, while in our work, individual power 
allocation is a vector optimization problem as the CUs have the flexibility to use all the channels that 
belong to a particular AP concurrently. This fundamental difference makes the considered problem more 
complex, hence the analytical frameworks provided by the above cited works are not suitable for our 
problem. ||9] is a recent work using non-cooperative game theory to address the problem of distributed 
energy-efficient power control in uplink multi-carrier CDMA system. Similarly as in the above cited 
works, the solution proposed by the authors mandates that the users choose a single optimum channel 
as well as a scalar power level to transmit on the selected channel. ||4] is a recent work considering 
the uplink dynamic spectrum sharing problem in a multi-carrier multiple service provider CRN. The 
authors propose algorithms for the users to select the size of the spectrum and the amount of power for 
transmission. One important assumption of this work is that the users can connect to multiple APs at 
the same time (we refer to such network as multiple-connectivity network), an assumption that simplifies 
the analysis significantly but may induce considerable signaling overhead on the network side as well 
as hardware implementation complexity on the cognitive devicqj. Even such issues may be resolved 
in the future, our work, which analyzes the single-connectivity network, can serve as a benchmark for 
comparison between single-connectivity and multiple-connectivity networks. 

We also argue that the problem under consideration is in many aspects more complicated than the 
traditional AP association problems arise in the 802.11 WLAN network (for example, lITTI . llT2l and ||T3l 
and the reference therein). Typically, AP association is aiming to optimize different system performance 



'in WLAN literature, such network is also refeixed to as "multi-homing" network, see 1101 and the reference therein. 



metrics (throughput, fairness, etc), and only simple individual throughput estimates within each AP are 
used to update the current association profile. Indeed, in 802.11 WLAN network, the throughput of an 
individual AP with fixed number of users and fixed physical bit rate can be approximated using simple 
analytical formulae |[T4l . and this result has greatly simplified the analysis of many work dealing with 
dynamic AP association in WLAN, e.g., |[T3l and IfTOl . 

We note here that the problem of how to dynamically perform the task of both spectrum decision and 
spectrum sharing may arise in different CRN configuration as well. Many of the current works addressing 
the spectrum management problem in multi-channel multi-user CRN focus only on the spectrum sharing 
aspect of the problem. For example, in lITSl . |[T6l . IITtI . a set of iterative water-filling 1^ (IWF) based 
algorithms are proposed to find a distributed solution of power allocation in multi-channel, multi-user 
CRN. One important assumption underlying these works is that the CUs are able to use all the channels 
simultaneously. This assumption might not be valid in the situation where the available spectrum is 
fragmented due to licensed user activities and the CUs are equipped with 1 -agile radio which can only 
use a single chunk of continuously aligned channels at a time In this scenario, the CUs need to first 
decide on which chunk of channels to use, and then make subsequent power allocation decisions on the 
selected set of channels, i.e., the CUs are required to perform the task of joint spectrum decision and 
spectrum sharing. It is our belief that our work can also serve to shed some lights on providing solutions 
to the above problem, as the network configuration considered in our work is sufficiently similar to the 
configuration mentioned above. 



B. Contributions and Organization of This Work 

To the best of our knowledge, this is the first work that proposes distributed algorithms to deal with joint 
AP selection and power allocation problem in a multi-channel multi-AP CRN. We cast the problem into 
a non-cooperative game framework, in which each CU's objective is to maximize its own transmission 
rate, and its strategy space is the union of a discrete set (the set of possible APs) and a multi-dimensional 
continuous set (the set of feasible power vectors). Although non-cooperative game theory has recently 
been extensively applied to solve the resource allocation problem in CRN (e.g., |[T5l . lITTl . ll24l and the 
reference therein), our formulation is considerably different and more involved because of such "hybrid" 



^IWF is originally proposed in 1181 in the context of DSL network, and subsequently applied to wireless network with vector 
multiple-access channel 1191 and with gaussian interference channel in 1201 . 1211 1221 . 

^See 1231 for discussion of agile radios and the possibility of this scenario in actual CRN implementations. 
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nature of the strategy space of the game. We analyze in detail the equilibrium solution of the game, and 
develop a suite of algorithms with provable convergence guarantees that enable the CUs to distributedly 
compute the equilibrium solution. 

We organize our paper as follows. In section III we present the network model under consideration 
and formulate the problem into a non-cooperative game. In section Jill we analyze the properties of the 
equilibrium solution. In section JVl we provide our main algorithm and its convergence results. In section 
Ivl and |Vll we provide important extensions of the JASPA. We present simulation results in section IVIII 
and conclude the paper in section IVIIII 

II. Problem Formulation 

A. Considered Network and Some Assumptions 

We consider the following cognitive network configuration. Suppose there are a set {1, 2, • • • , N} = M 
of CUs, a set {1, 2, • • • , K} = /C of channels and a set {1, 2, • • • , VF} = W of APs in the network, 
and we normalize the total available bandwidth to 1. Each AP ^« G W is assigned with a subset of 
channels IC^ C /C. We focus on the uplink scenario where each CU wants to connect to one of the APs 
for transmission. The foUowings are our main assumptions of the network. 

A-1) Each CU i is able to associate to all the APs, and each AP covers entire area of the network. 
A-2) The APs covering the same area operate on non-overlapping portions of the available spectrum. 
A-3) The set of spectrum can be used exclusively by the CRN for a relative long period of time. 
A-4) Each CU can associate to a single AP at a time; it can concurrently use all the channels of the 
associated AP, if desired. 

A-5) Each AP is equipped with single-user receivers. Different APs in the network do not compete with 
each other for revenue. 

Assumption A-1) is made merely for ease of presentation, and our work can be extended to the 
scenarios where different APs cover different areas of the network, and where the CUs can only connect 
to the subset of APs that cover them. 

Assumption A-2) is commonly used when considering AP association problems in WLAN (e.g., ifTOl ). 
or the spectrum sharing problem in cognitive network with multiple service providers (e.g.,|0]). It is made 
to mitigate interference between neighboring APs. It can be achieved either by 1) the APs agree offline the 
partition of the spectrum or 2) the APs jointly run a distributed online spectrum assignment algorithm 

''in the presence of multiple SPs, such offline negotiation can be made possible by the coordination of a spectrum clearing 
house, as suggested in I?)- 
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similar to the ones proposed in lITTI to determine the best spectrum assignment. How to determine the 
"optimum" partition of the spectrum is out of the scope of this paper. Assumption A-1) and A-2) imply 
that )C^f])Cq = ^, y w^q,k,qeW. 

B. System Model 

Let {|/ii,«;(^)P}fceA:.„ be the set of power gains from CU i to AP w on all its channels; Let 
be the set of environmental noise powers on all channels for AP w, Let the x 1 vector a denote the 
association profile in the network, with its z*^ element a(z) = w indicating that CU i is associated to AP 
w. Each CU i is able to obtain its own channel gains to all the APs, {|/ii,to(^')P}fcGK:„,,«iGW> via feedback 
from the APs, but it does not need to have the knowledge of other CUs' channel gains in the network. 

Let pi^w{k) represent the amount of power CU i transmits on channel k when it is associated with AP w; 
Let pi^w = {pi,w{k)}j.^i^2 be the power profile of CU i when it is associated with AP w; let p_i,^ be the 
joint power profiles of all the CUs other than i that is associated with AP w: p^i^w — {Pj,w}j:jyti,a(j)=w 
By construction, for all w G W, if w ^ a(i), then pi^.^; = 0. The power profiles of the CUs must satisfy 
the following two constraints (as in ||2l): 1) Total power constraints; 2) Positivity constraints. As such, 
each CU's feasible power allocation when it is associated with AP w can be expressed as: 

J^i,w = |pi,t« : ^ Pi,wik) < Pi, Pi,w{k) > 0, V /c e /C^j. 

fce/c„, 

Again assume that there is no interference cancelation performed at the AP, then for a fixed AP association 
and power allocation configuration, CU i's uplink transmission rate (when it is associated with AP w) 
can be expressed as follows: 

-^j(Pi,iii) P— 

^J_V^, \Ku,{k)\'^Pi,y,{k) >, 

= kT.H'+ nuk)+m )=^-(p-.--i-.-;^) (2) 

where Ii{k) denotes the aggregated received transmission power level on channel k except CU i, i.e., 

-^i(^) - XI \^j,Uk)\^Pj,wik), h,w = {Ii{k)}k(zic^- (3) 

j:a(j)=w,j^i 

We note that, if w = a(i), then li^^ can be viewed as the set of interference currently experienced by 
CU i; if u; / a(i), li^w can be viewed as the set of interference that CU i would experience if it were 
to switch to AP w. 
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We see that ([T]) and Q are equivalent definitions of the CU i's transmission rate. We will use either 
definition in the following paragraph depending on the context. 

C. A Non-Cooperative Game Formulation 

We model each CU i as selfish agent with the objective to find strategy (tf*, p*^.) that maximizes its 
transmission rate, based on the current state of the network: 

(tt;*,p*^.) G argmax max i2i(pi^, p_j^; w). (4) 

We are now ready to define a non-cooperative game Q: 

Q = {M,{xi}^e^^.{R^}i^M} (5) 

where the CUs f G are the players in the game; each CU's strategy space can be expressed as 
Xi — Uujew ^^^^ CU's utility function is its transmission rate Ri{Tpi^w ^V-i^w'^w) as defined 

in ([U. We emphasize that each feasible strategy of a player contains a discrete variable and a continuous 
vector, which makes the game Q different from (and thus more complicated than) most of the games 
considered in the context of network resource allocation. We refer to the strategy space {xi}ieN of this 
game as hybrid strategy space. 

The NE of this game is defined as the tuple |a*(i), P*a*(i)| . ^ ^uch that the following set of equations 
are satisfied: 

(a*(i),p*a.(i) ) G argmax max i?i(pj,^, p^l^ ; u;), i G (6) 
or equivalently, V i G A/", w G W, pi^w € Ti^w, 

Note that pi • ^ is defined as the power profiles of all the CUs other than CU i that is associated with 
AP w in the NE: = {p* a«-(j)}jyi,a*(j)=«)- We call the equilibrium profile a* a NE association 

profile, and p*. = {p*a*(t)}iGA/' ^ power allocation profile. In order to avoid duplicated definitions, 
we call the tuple (a*,p*. ) a joint equilibrium profile (JEP) of the game Q (instead of a NE). It is clear 
from either of the above definition that in a JEP, the system is stable in the sense that no CU has the 
incentive to deviate from either its AP association or its power allocation. 

III. Properties of the JEP 

In this section, we introduce the notion of the potential function for the game Q, and its relationship 
with the JEP. This function plays an important role in our following analysis of the existence of JEP and 
the proof of convergence of the algorithm. We then prove that the JEP always exists for the game Q. 
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A. The Potential Function 

Consider a simpler problem in which the association vector a is predetermined and fixed. In this case, 
the CUs do not need to choose their AP associations, thus the problem of finding the JEP defined in Q 
reduces to the one of finding the NE power allocation profile p* that satisfies: 

P*,a(i) ^ max Ri{pi,p% ,-y,a{{}), V i S A/". (7) 

For a specific AP w, denote the set of CUs associated with it to be Mw'. Nw — {i '■ = w}. It is clear 
that {Mti;}wew is a partition of Af. We use p^ = {Pi.wjljgjv" denote the long vector containing the 
power profiles of all CUs associated with AP w. When a is fixed, the activity of the set of CUs Mw, 
w does not affect the activity of the set of CUs J\fq, q € W, q ^ w, because of the fact that AP w 
and q operate on different sets of channels. Consequently, the original game Q introduced in (jS) can be 
decomposed into W independent small games, with each small game defined as: 

Gw = Ww, {^i,w]i£M^ , {R^}^£^^^ } ■ (8) 

Clearly, each of such small game has the same form as the spectrum sharing game G analyzed in 
Section III of the first part of the paper. We define the potential function for the small game as well 
as for the original game G as follows. 

Definition 1: The potential function of the game under a feasible power profile Pw is defined as: 

Pw{Pw;ai) = ^ (log (j^wik) + ^ \hi,w{k)\^Pi,w{k)^ - logn.u,(/c) j . 

The system potential function under a specific a and a feasible p is defined as the sum of the potential 
functions associated to all games {Q^}w<^w' 

P(p;a) = P^(p^;a). (9) 

weyv 

Define = rTjeAT -^i,w as the joint feasible set for the CUs that are associated with AP w under 
the association profile a, and let = IlujeW-^S- Let £'«,(a) denote the set of all NE power profiles 
for the game If], then <S(a) = niuGW'^«'(^) power profiles for the game Q 

under fixed association profile a. Let pj^,(a) be any one of such NE power profiles for game Q^, i.e., 
p^(a) G <?«)(a); let p*(a) be any one o/ the NE power profiles of the network, p*(a) € <f(a). The 

'indeed, as argued in Section III and IV-A of (2j, tlie spectmm sharing game G (Iience the small game Q'^) may have a 
connected set of NE power profiles. 
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following corollary regarding to the relationship between p^(a), p*(a) and the potential functions is a 
straightforward consequence of Theorem 1 and Corollary 1 of |[2l. 

Corollary 1: For fixed a, a feasible £ J'w rnaxhnizes the potential fi^nction P^„(p^;a) if and only 
if it is in the set <£'u,(a). We define the maximum value of the potential function: 

Pw{a) = max P^(p„;a) = Pu;(p^(a);a) (10) 

as an equilibrium potential (EP) for AP w under association profile a. 

For a fixed a, a feasible p G that maximizes the system potential function P(p; a) if and only if 
it is in the set £{a). Similarly as above, we refer to the maximum value of the system potential function 
as the system equilibrium potential (SEP) under association profile a, and denoted it by P{a): 

P(a) ^ J] P^(a). (11) 

B. Existence of JEP 

In this section, we discuss the existence of the JEP as defined in We emphasize here that determining 
the existence of the JEP (which is a pure NE) for the game Q is by no means a trivial proposition. Due to 
the hybrid structure of the game Q, the standard results on the existence of pure NE of either continuous 
or discrete games can not be applied.Consequently, we have to explore the structure of the problem in 
proving the existence of JEP for the game Q. 

From Corollary [T]we see that a specific a can be mapped to a SEP, denoted by -P(a). We claim that 
any one of the AP association profiles a that maximizes the SEP, along with any one of its corresponding 
system power profile p*(a) G £(sl), constitute a JEP as defined in We state this observation in the 
following theorem. 

Theorem 1: The game G as defined in ([5]) always admits a JEP. An association profile a e arg maxa ^(a) 
along with any one of its corresponding NE power allocation profile p*(a)(j) = |p*a(-j)|. ^ G ^(S), 
constitute a JEP of the game G that satisfies 

Proof: We prove this theorem by contradiction. Suppose a maximizes the system potential, but a 
is not a NE association profile. Then there must exist a CU i who wants to switch from a(i) = u; to a 
different AP w ^ w. Define a new association profile a as: 

. 5(i) forj / i 
aO) = { ^ (12) 
w for j = i. 



10 



Let p*(a) G £'(a) and p*(a) G The maximum rate that CU i can get after switching to w if all 

other CUs do not change their actions: 

^t(p»,«;,Pffl(a); w) 



^ log ( 1 



, ( nsjk) + 

= -F'ffi(Pj,ffi,Ps(a);a) -Pffi(pl(a);a) (13) 

where /*(^) is defined similarly as in ©, and the vector pj ,j; is determined by: 

Pi,«; = arg max ^j(pj, p^(a); u;). (14) 

We can view the rate i?j(pj^^, pl(a); as CU i's estimate of the maximum rate it can get if it were 
to switch to AP w. 

Because CU i prefers w, from the definition of the JEP ^ we see that its current communication rate 
must be strictly less than its estimated maximum rate, i.e., the following must be true: 

Ri{vlw{^),V-i,ih{^)]w) < Riipi^s, pl{a);w) (15) 

where i2j(p* ~(a), p^^ ~{a.);w) is the actual transmission rate for CU i in the association profile a, and 
it can be expressed as follows: 

= P^(ptj(5);5) - P^(p*_,,^(5);5). (16) 
Combining ( fT3] ). ([T5] ) and ([T6l ) we must have that: 

^',5(p^(a);a)--P^(pli,^(a);a) < 

Pw{Pi,iS,P{n(^);^) - ^«;(p^;(a);a). (17) 

We notice that the term P^j {p*_- ~ (a) ; a) is equivalent to Pi^ {p*_- ~ (a) ; a) due to the equivalence of the 
following sets: 

{j : j ^ i,a{j) =w} = {j : j ^ i,a{j) = w}. (18) 

Recall that from Corollary [T] we have that the NE power allocation profile maximizes the potential 
function: p,~(a) G argmaXp^gjra i-'|j(p{5; a). Observe that the set of CUs associated with AP w under 
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profile a is the same as the set of CUs associated with AP a under profile a excluding CU i, we must 
have p*_^~(a.) G T-. Consequently, the following is true: 

P^5(pl(a);a) > P^(p*_,^~(S); a) P^(pl,^~(S); 5) (19) 

where (a) is from ([TSll . Similarly, we have that: 

P^5(pl(a);a) > ^^(pi,^;, pl(5); a). (20) 

Combining (O, and ([T7]l, we have that: 

Pa(p~(a);a) - Ps{plj{a);a) < Psi(p^(a);a) - Pffl(pji(a);a) 

which essentially says that after i switched to AP id, the decrease of EP of AP w is less than the increase 
of the EP of AP w. In other words, we have that: 

Pis{p*si^);a) + Ps{pUa);a) < Pffi(p.^(S); S) + Ps(pJi(S); S). (21) 

Noticing that the equilibrium potentials of all the APs other than w and id are the same between the 
profile a and a, thus adding them to both sides of (|2TI ) we have that: 

P^(p;(S);5) < P^(p;(a);a) (22) 

which is equivalent to: 

P(S) < P(a). (23) 

This is a contradiction to the assumption that -P(a) maximizes the system potential. We conclude that 
a must be a NE association profile. Clearly, p*(a) is a NE power allocation profile. Consequently, we 
have that (a, p*(a)) is a JEP ■ 

IV. The Proposed Algorithm 

In this section, we introduce our main algorithm, referred to as the Joint Access point Selection and 
Power Allocation (JASPA) algorithm, that allows the CUs in the network to distributely compute the 
JEP. To this end, we first introduce a simple scheme that assigns the CUs to their closest AP, a scheme 
which essentially separates the process of AP association and power allocation. This scheme, although 
relatively simple, offers valuable insights upon which we build the JASPA algorithm, in subsection IIV-BI 
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A. Closest AP Association Algorithms 

Consider a fixed AP association profile a in which each CU is assigned to its closest AP. The 
"closeness", or "distance" from a CU to the APs can be measured either by the physical distance between 
them, or by the strength of pilot/control signal received by the CU from the AP Assuming that each CU 
has a single closest AP 0, then the AP association profile is unique and the computation of JEP reduces 
to the problem of finding the NE power allocation profile. Moreover, as mentioned before, the CUs are 
partitioned into independent sets Mw — {i '■ a(0 = w}, and the CUs in each set AC can compute their 
NE power allocation profile without taking into consideration the behaviors of the CUs in other sets. 

Clearly, this scheme separates the process of spectrum decision and spectrum sharing, and the CUs 
only need to carry out the task of sharing the spectrum available to the designated AP with other CUs. 
However, as we probably can speculate, no matter how efficient such sharing scheme is, the overall 
system performance might suffer because of the fixed and inefficient AP assignment. We will see such 
performance degradation later in the simulation section. 

We note that from Proposition 2 and 3 in ||2], for a specific association profile a, the set of CUs Mw 
that is associates to the same AP w are able to distributedly decide on their NE power allocation profiles 
by running either the A-IWF or the S-IWF algorithm (cf. Algorithm 1 and Algorithm 2 in 111). 

B. The Joint AP Selection and Power Allocation Algorithm 

We name the proposed algorithm Joint Access Point Selection and Power Allocation (JASPA) algorithm. 
Intuitively, the proposed algorithm works as follows. For a fixed AP association profile, all CUs calculate 
iteratively their NE power allocations. After convergence, they individually try to see if they can strictly 
increase their communication rates by switching to another AP, assuming that all other CUs keep their 
current AP associations and power profiles. When CU i decides that its next best AP association should 
be w* , we record his decision by a x 1 best reply vector hi : bj = e^* , where denotes a. W x I 
elementary vector with all entries except for the j*'^ entry, which takes the value 1. In the next iteration, 
CU i's actual AP association decision is made according to a x 1 probability vector (3\, which is 
properly updated in each iteration according to bj. We also suppose that each CU has a length M memory, 
operated in a first in first out (FIFO) fashion, that records its last M best reply vectors. 

The proposed algorithm is detailed as follows. 

*This assumption is without loss of generality because if two APs have the same "distance" to a CU, they can be further 
ranked by other closeness criterion. 
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1) Initialization: Let t=0, CUs randomly choose their APs. 

2) Calculation of the NE Power Allocation Profile: Based on the current association a*, all the CUs 

calculate their NE power allocations p*(a*), either by A-IWF or S-IWF algorithm. We call the process 
of reaching such intermediate equilibrium an "inner loop". 

3) Selection of the Best AP Association: Each CU i talks to all the APs in the network, obtains 
necessary information in order to find a set of APs W* such that all w G W* satisfies lu ^ a*(i) and: 

max i?i(pi,^,p;(a*);u;) > iii(p*(a*),pli(a*);a*(i)). (24) 

If W* / 0, obtain the w* G Wj that can offer the maximum rate (ties are randomly broken); otherwise, 
let w* = a*(i). Set the best reply vector b-"^^ = e^*. 

4) Update Probability Vector: For each CU i, update the x 1 probability vector /3* according to: 

/3* + ig(b*+^-b*-'') if M<t 

/3* + 17(b*+'-bD ifM>t>0 (25) 
hj if t = 0. 



/3; 



t+1 



Shift b.^ into the end of the memory; shift b^" out from the front of the memory if t > M. 

5) Determine the Next AP Association: Each CU i samples the AP index for association at next 
iteration according to the probability P^^^, i.e., 

a^+^{i) ^multi{(3l+^) (26) 

where multi{.) represents a multinomial distribution. 

6) Continue: Let t=t+l, and go to Step 2). 

We make several comments regarding to the above JASPA algorithm. 

Remark 1: It is crucial that each CU finally decides on choosing a single AP for transmission. Failing 
to do so will result in system instability, in which the CUs switch AP association indefinitely, and much 
of the system resource will be wasted for closing old connections and re-establishing new connections 
between the APs and CUs. In another word, it is preferable that for all i G Af, limf_j.oo /3* = (3* and: 

{1 for a single w G W 
(27) 
otherwise. 

Remark 2: The best reply vectors {hf^^}i^j^f are decided in each iteration based on the other CUs' 
AP associations and power profiles in the previous iteration. It is straightforward to show that in order to 
calculate maxp. ^^gj-. ^ i2j(pj^, pj^(a*); w) for different w G W, individual CU i does not need to know 
the strategies of all other CUs in the network, nor does it need to know the system association profile 
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a*. Instead, it only requires the information of aggregated interference plus noise on each channel from 
each AP of the last iteration. This is precisely the necessary information needed for finding the set W* 
in Step 3) of the JASPA. This property of the algorithm contributes to the reduction of the amount of 
messages exchanged between APs and each CU when making association decisions. 

Remark 3: Considering the overhead regarding to end an old connection and re-establish a new 
connection, it is reasonable to assume that a selfish CU is unwilling to abandon its current AP if the new 
one cannot offer significant improvement of the data rate. We can model such unwillingness of the CUs 
by introducing a connection cost q > 0, which is a private parameter for each CU i. A CU i will only 
seek to switch to a new AP if the new one can offer rate improvement of at least Cj, i.e., it will only 
switch to those APs w G W* that satisfies: 

max Ri{Yii,w,Pl,{si)\w) > i?j(p*(a*), plj(a*); a*(i)) + q. 

From a system point of view, such unwillingness to switch by the CUs might contribute to improved 
convergence speed of the algorithm, but might also result in reduced system throughput. These two 
phenomenons are indeed observed in our simulations, please see section IVIII for examples. We note that 
the the equilibrium solution resulted from using the costs {ci}i^_^f is closely related to the notion of 
"e-equilibrium" in the game theory. See chapter 4 of ll25l for details. 

C. Proof of Convergence 

In this section, we prove that the JASPA algorithm converges to a JEP globally, i.e., the algorithm 
converges regardless of the initial starting points of the algorithm, or the realizations of the channel gains. 

We first introduce some notations. Let c* be a vector denoting the best reply association profile at 
time t, i.e., c*(i) = w if and only if b*(w) = 1. Define a set C and A as follows: c £ C 
c infinitely often in {c*}^^. and a. G A <J=> a infinitely often in {a^}^^. We first provide a propo- 
sition stating that there must exist a NE association profile a* that satisfies a* G A. The proof of this 
proposition can be found in Appendix |A] 

Proposition I: Choose M > N. Then at least one element in the set A, say a*, is a NE association 
profile. Moreover, (a* , p* (a* ) ) is a JEP ( satisfy equation ^ ). 

Using the result in Proposition [U we obtain the following convergence results. 

Theorem 2: When choosing M > N, the JASPA algorithm produces a sequence |(a*, p*(a*))|^j^ 
that converges to a JEP (a*,p*(a*)) with probability 1. 
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Proof: We first show that the sequence {a*}^^ converges to an equiUbrium profile a*. Notice that 
if at time T, a^ = a*, and in the next M iterations, we always have a^"*"* = a*, t = 1, • • • , M, then 
the algorithm converges. 

Let A* ^ A contains all the NE association profiles in A. Let {a*^'^) : A; > 1} be the infinite subsequence 
satisfying a*^'^) € A* . Without loss of generality, assume t{k) — t{k — 1) > M. Let us denote by Ck 
the event in which the process converges to a a* ^ A*, after a sequence of best replies equals to a* 
of length M occurs, starting at time Ck = nfii{a*^''^+' = a*}. Note, Pr{Ck+i\C^) > (jj)^''^, 
because whenever a* appears, each CU i's best reply should be a*{i), hence a*(i) will be inserted into 
the last slot of CU i's memory. Then with probability [jj]^ , all CUs sample the last memory and a* 
will appear in the next iteration. Thus, 

Pr( n C^=^^S'(f] C'^-^^J[[(^ - Pr{C,^i\C^)) 

\k>l J \k=l / k=l 

< lim =0. (28) 

This says Pr(a* converges to a a* G A* eventually) = 1. Finally, because p*(a*) G if (a*) is a NE 
power allocation profile, we conclude that (a*,p*(a*)) is a JEP. ■ 

We mention that the requirement on the length of the memory is technical in order to facilitate the 
proof. In simulations, we observe that such requirement is not necessary for ensuring convergence. 

Now that we have shown the convergence of the JASAP to the JEP, it is of interest to evaluate the 
"quality" of such network equilibrium. In this work, we use the system throughput to measure the quality 
of the JEP, and our simulation results (to be shown in section IVIII ) are very encouraging. 

V. Extensions to the JASPA Algorithm 

The JASPA algorithm presented in the previous section is "distributed" in the sense that the computation 
that each CU needs to carry out in each iteration only requires some local/summary information, i.e., the 
aggregated interference plus noise at different APs in different channels, and the CU's own channel gain. 
However, this algorithm requires that for each AP association profile a*, an intermediate equilibrium 
p*(a*) should be reached, and at each iteration t the CUs cannot choose their next AP association 
profile until the system reaches such equilibrium. This requirement poses a relatively strong level of 
coordination among the CUs (although this issue can be alleviated by letting the APs orchestrate the 
updating instances), which is not entirely desirable for a distributed algorithm. 

In this section, we propose two algorithms that do not require that the CUs reach any intermediate 
equilibria. Specifically, we propose 1) a sequential version of the JASPA algorithm (Se- JASPA) in which 
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CUs act one by one in each step, and 2) a simultaneous/parallel version of the JASPA algorithm (Si- 
JASPA) in which CUs act at the same time. 
The Se-JASPA algorithm is detailed in Table H 

1) Initialization (t=0): Each CU randomly chooses a''(i) and ^"(1) 

2) Determine the Next AP Association: 

If it is CU i's turn to act, (e.g., {{t + l)modeiV} + 1 = i), then CU i 
finds a set W* s.t.: 

W* = argmaxu,gw maxp^^gjr, ^„ R{Pi,w, Pw;w) 
Then it selects an AP by randomly picking w* G W* and setting a'+^(i) — w*. 
For other CUs j / i, a*+^(j) = a*(j) 

3) Update the Power Allocation: 

Denote w* — a^'^^{i). Then CU i calculates p*"*"^ as 

(argmaxp^^.6^^^^„. i?i(pi,u,. , p^. ; w*), if w* / a*(j) 
argmaxp.^. 6jr. Ri{pi,w* ,P-i,w*;w*), otherwise 
For other CUs j / i, p*+^ = p* 

4) Continue: Let t=t+l, and go to Step 2) 

TABLE I 
The Se-JASPA Algorithm 

We partially characterize the convergence behavior of Se-JASPA algorithm in the following theorem, 
the proof of which can be found in Appendix IB] 

Theorem 3: The sequence of system potential {P(p*,a*)}^^ produced by the Se-JASPA algorithm is 
non-decreasing and converging. 

Some brief comments regarding to the Se-JASPA algorithm is in order. We see that the Se-JASPA 
algorithm differs from the JASPA algorithm in several important ways. Firstly, a CU i does not need to 
keep its best reply vector b* as it does in JASPA. It decides on its AP association greedily in step 2). 
Secondly, a CU i, after deciding a new AP a^~^^{i) = w*, does not need to go through the process of 
reaching an intermediate equilibrium with all other CUs to obtain p*^l. However, the CUs still need 
to be coordinated for the exact sequence of their update, because in each iteration only a single CU 
is allowed to act. Such order of update can be agreed upon and enforced by the APs in the network. 
As might be inferred by the sequential nature of this algorithm, when the number of CUs is large, the 
convergence becomes slow. 

The Si-JASPA algorithm, as detailed in Table |lll overcomes the above difficulties encountered in Se- 
JASPA. We note that in the algorithm, the variable Tj represents the duration that CU i has stayed in the 
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1) Initialization (t=0): Each CU i randomly chooses a''(i) and p^^^ofi) 

2) Selection of the Best Reply Association: 

Each CU obtains the AP w* and set b'+^ following Step 3) of JASPA 

3) Update Probability Vector: 

Each CU i updates the probability vector /3' according to ( I25t 
Shift b*+^ into the memory; shift b*"^^ out of memory if t > M 

4) Determine the Next AP Association: 

Each CU i samples the AP index for association as in i26i 

5) Compute the Best Reply Power Allocation: 
Let wl'^^ — a*^^(i). Each CUs i calculates p* as 

p* = maxp^ Ri{p^^^t+i , pLi^^t+i ; 

6) Update the Duration of Stay: 

Each CU i maintains and updates a variable Ti: 




1 if a'+i(i) /a'(i) 

T. + l if a*+i(i) = a*(i) 



7) Update the Power Allocation: 

Each CU i calculates p*"*"^ as follows: 




TABLE II 



The Si- JASPA Algorithm 



current AP, and the stepsizes {at} is similarly defined as in the A-IWF algorithm in ||2]: Of € (0, 1) and 



T 



T 




(29) 



t=i 



t=i 



We see that the structure of the Si-JASPA is almost the same as the JASPA except that each CU, 



after switching to a new AP, does not need to go through the process of joint computation of the 



intermediate equilibrium solution. Instead, the CUs can make their AP decision "continuously". The level 



of coordination among the CUs required for this algorithm is minimum among all the three algorithms 



introduced so far. The simultaneous update required by this algorithm can be realized by either one of 



the following approaches: 



• The APs agree upon the update interval off-line. Each CU is equipped with a timer. The first time 



a CU comes into the system, it is informed by its initial associated AP the update interval and the 



next update instance. After that, this CU can perform update on its own. 



• The APs agree upon the update interval off-fine. When the time comes for the update, the APs 
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individually alert the CUs associated with them by broadcasting. 



Extensive simulations suggest that this algorithm converges faster than the Se-JASPA. 



VI. JASPA Based on Network-Wide Joint-Strategy 



The Se/Si-JASPA algorithms introduced in the previous section reliefs the CUs from the burden of 
reaching intermediate equilibrium. However, the lack of general proof of convergence for them might be 
a concern to us (although they appear to be always convergent in practice). In this section, an alternative 
algorithm with convergence guarantee is proposed. This algorithm allows the CUs, as in the Se/Si-JSPA, to 
jointly select their power profiles and AP association without the need to reach the intermediate equiUbria. 
We will see later that compared with all the algorithms introduced previously, the algorithm studied in 
this section requires considerably different information/memory structure for both the CUs and the APs. 
Among others, it requires that the CUs maintain in their memory some history of the network-wide joint 
strategy of all CUs. We henceforth name this algorithm Joint-strategy JASPA (J-JASPA). 

A. The J-JASPA Algorithm 

We first give some definitions. As all previously mentioned algorithms, the J-JASPA algorithm is 
iterative in nature, thus in the following we use t to denote the t*^ iteration of the algorithm, if needed. 

• Let J\f^ = |z : a*(i) = i«} be the set of CUs that are associated with AP w in iteration t. 

• Let Q be any subset of M. Define the last time that the subset Q of CUs is associated with a 
particular AP w as t«,(Q), i.e.. 



• Let I{Mw) be the joint interference profile by the subset of CUs Afw that is associated with AP w. 
i{^fw) = {h,w}i^j^^, where 1^,^ is defined in ©. 

As we mentioned before, one of the distinct feature of J-JASPA algorithm is the information/memory 
structure required for carrying out the computation. Specifically, each CU i keeps three different memories, 
each of which is of length M and operates in a FIFO fashion. The first memory, referred to as association 
memory (AM), records CU i's last M associated AP {a*(i)}^^_j^^^^^, i.e., AMi{m) = a'^~*^+™(z). 
Here we use AMi{m) to denote m^^ element in CU i's AM. The second memory, referred to as 
interference memory (IM), records the last M system interference levels for CU i, where 
!■ = {I*^}^g>v. The third memory, referred to as rate memory, records the last M CU i's sum rate. 



argmax^AC = Q} if U>o {K = Q} ^ ^ 



(30) 



oo 



otherwise. 
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Each AP w is also required to keep track of some local quantities Y\ regarding to the history of the 
CU behaviors. Specifically, AP w keeps track of the following variables for each subset Q'^ N that has 
been associated with AP w during time [0, T] at least once: 

• The local power profile p(Q) = Ip*™'^^^ \ 

• The local interference profile I(Q) = 

• The total number of times that Q has been played: T{Q) = J2t<T '^{■^w — Q}' where 1{.} is the 
indicator function. 

Then the J-JASPA algorithm can be detailed as follows: 

1) Initialization: Let t = 0, each CU i randomly chooses the a^{i) E W and e -7^i,a"(i)- 

2) Update CU Memory: For each i e M, talk to all AP in the system and obtain I*. Shift a*(i), I*, and 

(Pia*{j)'P-ia'(0'^*'^*'') ^"'^ °^ ™^ respectively. If t > M, shift the first 

element of the AM, IM and RM out of the memory. 

3) Update AP Memory: For each w e W, update the vectors: p(7V^) = |p-^„| ; l{J^i) = 



f«) + l iff(Ar4)y^OO 



■e-^™ 1 ifT{Mi) = oo 



4) Sample Memory: Let M = min{M, t}, each CU i uniformly samples its association memory: 

Sample index\ = multi(\-^, • • • , t^]^; 

^ M ^ Af ^ 

M elements 

Let a\ = AMi{index\), l\ = IMi{indexj), R\ = RMi{index\). (31) 

5) Calculate Best AP Association: Each CU i finds association according to a\, I- and R\, i.e., find the 
set of APs W* such that: 

W* = I : ^ max (p^,^ , I* „, ; u;) > ii* | U a* . (32) 
Then randomly pick w* G W*, and set a*+^(i) = w*. 

6) Calculate Power Allocation: Each CU i switches to AP a*+^(i). Let w = a*+^(z), then CU i obtain 
the following quantities from this AP w\ f{M^^), Ii(7V^+^), and pi(AA^+i). jf f (A(^+i) > 1 (the set 
of CUs Nw has been associated with AP w at the same time before), let a = af^j^t+iy and: 

p*+i = (1 - a)p,«+i) + a*, . (33) 



^Here, we use "local" to signify tlie fact that individual AP can gather these information without the need to communicate 
with other APs. 
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If T{Af^^) = oo (the set of CUs Mw has not been associated with AP w at the same time before), 

randomly pick p-"*"^ E -^i.a*+i(j)- 

7) Continue: Let t = t + 1, go to step 2). 

We see that although algorithmically the J-JASPA is similar to the previously introduced algorithms 
in the sense that the AP associations are decided probabilistically, and the power profiles are computed 
based on historical profiles and newly computed components there are several significant differences 
between the J-JASPA and the previously introduced algorithms. 

1) In J-JASPA, each CU calculates its best AP association according to a sampled historical network 
state, while in the JASPA and Si-JASPA, it calculates this quantity according to the current network state. 

2) In J-JASPA, CUs' AP association is the same as their best AP association, while in Si-JASPA and 
JASPA, their AP association is sampled from the memory. 

3) This algorithm requires APs to have memory. Each AP needs to record the local power allocation and 
interference profiles for all the different sets of CUs that have been associated with it in the previous 
iterations, while in the previously introduced algorithms, the APs do not need to have memory. 

4) The J-JASPA requires larger memory for the CUs for constructing AM, IM and RM. 

5) The J-JASPA requires extra communications between the CUs and the APs ( mainly in step 6). 

We will see in the next subsection that it is exactly these changes in the algorithm and the extra 
requirements in terms of memory and communication that enables the J-JASPA to have provable conver- 
gence guarantees without the need to reach the intermediate equilibria. This is a significant improvement 
compared with the original JASPA, which does need the CUs to reach the intermediate equilibria, and the 
Se/Si- JASPA, for which we are not able to provide complete convergence proofs. We have also observed 
in simulation that J-JASPA converges faster than Se/Si-JASPA. 

B. The convergence of the J-JASPA algorithm 

In this subsection, we show that the J-JASPA algorithm converges to a JEP. 

Define the set A as follows: a G ^ <;=^ a infinitely often in {a*}^j^. We first provide a proposition 
characterizing the power profiles of the CUs in the network every time a profile a G ^ appears. 

Proposition 2: Choose a. & A. Let be the subsequence of such that a is played, 

i.e., : = aj-. Then we have for all w G W, lim^^^Qo Pui^^ = P^(a), where p^(a) is a 

NE power allocation profile for AP w under a, i.e., p^(a) G Furthermore, we have that 

lim„_,ooP«, (p^i"^a*W) = P^(a) and lim„^oo P (p*("\ a*{")) = P(a). 
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Proof: For a € W, let Mw = {i : a(z) = t/;}. Define another subsequence in which the 

subset of CUs Mw is associated with AP w. Clearly, {t(n)}^i is a subsequence of From the J- 

JASPA algorithm, we see that at each t{n), ( [33] ) implements the single AP A-IWF (cf. Algorithm 1 in Ii2il ) 

{tin) 1 ^ 
J n=l 

converges to pJu(a) G f'«,(a), which is a NE power allocation profile under fixed system association 
profile a. Consequently, the infinite sub-subsequence IpJi"'' | also converges to the same p^(a). From 
Corollary [U we have that lim„^oo Pw (pT^ ] a*(") ) = -R„(a) and lim„_^oo P (p*^"^ , a*(")) = P(a). ■ 



We need the following set of definitions to proceed. Let a* be the sampled system profile at time t: 
a*(i) = a*, V i. For a specific a, define the subsequence {t(n,a)} be the time instances that a appears 
and is immediately sampled by all the CUs, i.e., |t(n,a) : a*^"''*) = a and a'^"'**^ = a}. Note that if 
a* = a, then according to step 5) of the J-JASPA algorithm, with positive probability a* = a. Thus, if 
a € A then {t(n,a)} is a infinite sequence. Define R* = maxp^^gjr. Ri ^p,^^, I* ^; tt;^ to 

be the maximum rate CU i can achieve in AP w based on sampled interference I* ^. Define a set: 

Bi (l*,a*(i)) ^ [w : R* (li^;w) > R^} \Ja\i) (34) 

where Rj is defined in (|3T] ). We call the set Bi ^I*,a*(i)^ CU i's best association set at time t. From 
step 5) of the J-JASPA algorithm, all w £ Bi ^I*,a*(i)^ has positive probability to be picked by CU 
i in iteration t + 1. Let i?j (I*(a), a(i)) = lim„^oo -Bi a*("''^)(i)^^ We provide a technical 

characterization of the best association set. See Appendix O for proof. 

Proposition 3: For a specific CU i and a system association profile a A, suppose there exists a 
w ^ a(i) such that w £ Bi (I*(a), a(i)), i.e., CU i has the incentive to move to a different AP in the 
limit. Then there exists a large enough constant N*{a) such that for all n > N*{a), we have: 

B, (i:(a),a(i)) C B, (ll("''^\a*("'=^)(i)) . (35) 



In words, this propositions says that suppose a specific association profile a happens infinitely often 
(hence is sampled infinitely often), and suppose in the limit, when a is sampled, a CU i prefers a.w^ a(i), 
then after iteration t{N*{a), a), it must prefer w in every time instance i(n, a), where n > Nl{a). Now 
we are ready to provide the main result for the J-JASPA algorithm. 

Theorem 4: The J-JASPA algorithm converges to a JEP with probability 1. 

Proof: Consider the sequence {(a*, p*)}^j^. Choose a to be any system association profile that 
satisfies the following: a € argmaxae^{^(a)}. We first show that (a, p*(a)) is a JEP. 

^Note that such limit exist because of Proposition |2] 
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Suppose (a, p*(a)) is not a JEP, then there exists a CU i, and aw ^ a{i) such that w ^I?(a), a(z)^ . 
This implies that there exists an e > such that: 



4 - Rl (%(,)(S);5(z)j > e. (36) 

Define a new association profile a as: 

a.(j) for j 7^ i 
u; J ^^^^ 

i(; for j = i. 

Following the steps we already went through in Theorem [T] from ([T2l ) to (1231 ). we can show that: 

P(5) < P(a). (38) 

It is clear that if a G then the above is a contradiction to the assumption that a G arg maXaG^{P(a)}. 
In the following, we show that a. ^ A, thus completing the proof. 

From Proposition |3l there exists a A'^-*(a) large enough that for all n > iVf (a), w G B- (^i!("'^),a*("''^)(i; 
Take any n > N-*(sl). We know that from the definition, in iteration t(n,a), a*("''*)(i) = a, V i G M. 
From Step 5) in the J-JASPA algorithm, we see that with positive probability, in iteration a) + 1, CU 
j ^ i chooses to stay in a(j), and CU i chooses to switch to w. This implies that the association profile 
a happens with positive probability in every time instance t(n,a) + 1. Because {t(?i,a)} is a infinite 
sequence, a happens infinitely often, i.e., a G ^. 

In summary, we conclude that a must be a NE association profile, and thus, (a, p*(a)) is a JEP. 

Finally, following the proofs of Theorem |2] we can show similarly that the sequence {(a*, p(a*))}^j^ 
that produced by J-JASPA converges to a JEP with probability 1 . ■ 

VII. Simulation Results 

In this section, we present various simulation results to validate the proposed algorithms. We first show 
the results regarding to the convergence properties, and then present the results regarding to the system 
throughput performance. Due to the space limit, for each experiment we show the results obtained by 
running either Si/Se-JASPA and J-JASPA, or the results obtained by the original JASPA. 

We have the following general settings for the simulation. We place multiple CUs and APs randomly 
in a 10m x 10m area; we let denote the distance between CU i and AP w, then the channel gains 
between CU i and AP w, {|/ij,ui(^)P}fcGAC,„' independently drawn from an exponential distribution 
with mean (i.e., is assumed to have Rayleigh distribution). We let the available channels 

to be evenly pre-assigned to different APs. When we say a "snapshot" of the network, we refer to the 
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network with fixed (but randomly generated as above) AP, CU locations and channel gains. We set the 
length of the individual memory as M = 10. For ease of presentation and comparison, when we use the 
JASPA algorithm with connection cost, we let all the CUs' connection cost {cj}ig^ be identical. 

A. Convergence 

We only show the results for Si/Se-JASPA and J- JASPA in this subsection. We first consider a network 
with 20 CUs, 64 channels, and 4 APs. Fig. [J shows the evolution of the system throughput as well 
as the values of the system potential function generated by a typical run of the Se-JASPA, Si-JASPA, 
J-JASPA and Si-JASPA with connection cost q = 3 bit/sec V i € A/". We observe that the Si-JASPA with 
connection cost converges faster than Si-JASPA and Se-JASPA, while Se-JASPA converges very slowly. 
After convergence, the system throughput achieved by Si-JASPA with connection cost is smaller than 
that of the other three algorithms. Notice that in the right part of Fig. [T] the system potential generated 
by the Se-JASPA is non-decreasing along iterations. This property has been identified in Theorem |3] 




Fig. 1. Comparison of convergence speed by different algoritlims. Left: evolution of system sum rate. Riglit: evolution of the 
value of system potential function. 

Fig. E] shows the evolution of the AP selections made by the CUs in the network during a typical run of 
the Si-JASPA algorithm. We only show 3 out of 20 CUs (we refer the selected CUs as CU 1, 2, 3 for easy 
reference) in order not to make the figure overly crowded. Fig |2] shows the corresponding evolution of the 
probability vectors {/3-}^£'{ for the three of the CUs selected in Fig. |3] It is clear that upon convergence, 
all the probability vector converges to a 0-1 vector. We then evaluate how the number of CUs in the 
network affects the speed of convergence of different algorithms. In order to do so, we compare the 
average iterations to achieve convergence in the network with 4 APs, 64 channels and different number 
of CUs, for the following three algorithms 1) Si-JASPA, 2) Se-JASPA, 3) J-JASPA, 4) Si-JASPA with 



24 




Iteralions Iterations Iterations 



Fig. 2. Convergence of the probability vector j3l, P2 and of CUs' 1, 2, 3. 

connection cost Cj = 3 bit/sec for all CUs. From FigJH we see that when the number of CUs in the 
system becomes large, the sequential version of the JASPA takes significantly longer time to converge 
than the other three simultaneous versions of the JASPA algorithm. Moreover, the J- JASPA shows faster 
convergence than the Si/Se- JASPA. We can also see that the connection costs adopted by individual CUs 
indeed have positive effects on the convergence speed of the system. 




Fig. 3. Convergence of Selected CUs' AP selection. Fig. 4. Comparison of averaged convergence speeds. 

Note that each point in this figure represents the average of 1 00 independent runs of each algorithm 
on randomly generated network snapshots. 

B. System Throughput Performance 

We then evaluate the network throughput performance achievable by the JEP computed by the JASPA. 

We first investigate a small networks with 8 CUs, 64 channels and 1, 2, 3, 4 APs, and compare 
the performance of JASPA related algorithms to the maximum network throughput that can be achieved 
for the same network. The maximum network throughput for a snapshot of the network is calculated 
by the following two steps: 1) for a specific AP-CU association profile, say a, calculate the maximum 
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network throughput (denoted by T(a)) by summing up the maximum capacitjf] of individual APs in the 
network; 2) enumerate all possible AP-CU association profiles, and find T* = maxaT(a). It is clear 
now that the reason we choose to focus on such relatively small networks in this experiment is that for a 
large network, the time it takes for the above exhaustive search procedure to find the maximum network 
throughput becomes prohibitive. 

The result is shown in FigJS) where each point on the figure is obtained by running the algorithms on 
100 independent snapshots of the network. We see that the JASPA algorithm performs very well with 
little throughput loss, while the closest AP algorithm, which separates the tasks of spectrum decision and 
spectrum sharing, performs poorly. 

We then start to look at the performance of larger networks with 30 CUs, up to 16 APs and up to 
128 channels. Fig. |6] shows the comparison of the performance of JASPA, JASPA with individual cost 
Cj = 3 bit/sec and q = 5 bit/sec, and the closest AP algorithm mentioned in section IIV-AI We adopt the 
actual distance as the measure of "closeness" in the closest AP algorithm. Each point in this figure is 
the average of 100 independent runs of the algorithms. 




Number of APs Number of APs 

Fig. 5. Comparison of the system throughput by different Fig. 6. Comparison of the system throughput 
algorithms with the throughput upper bound in a 8 CU network, in a 30 CU network. 

Due to the prohibitive computation time required, we are unable to obtain the maximum system 
throughput for these relatively large networks. We instead compute the equilibrium system throughput 
that can be achieved in a game if all CUs are able to connect to multiple APs at the same time. We refer 
to this as the multiple-connectivity network. It is clear that in such network, there is no need for the CUs 

'For a single AP with fixed number of users and channel gains, the maximum capacity is the well-known multiple access 
channel sum capacity. 
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to perform the AP selection, and the CUs in this network enjoy the flexibiUty of being able to connect 
to multiple APs at the same time. However, we observe that the performance of JASPA is close to that 
of the "multiple-connectivity" network. 

From Fig. |6] we see that when the number of APs increases, the throughput of the JASPA algorithm 
becomes much better than the closest AP algorithm, a phenomenon that is partly due to the fact that 
for the closest AP algorithm, the separation of the AP selection and power allocation process results in 
the insufficient use of the spectrum: when the number of AP increases, it becomes increasingly more 
probable that several APs are idle because no CUs are close to them. Fig. [6l along with Fig. |4l also serve 
to confirm our early speculation that algorithms with connection cost can indeed improve the convergence 
speed while reducing the system throughput. 

We also observe from Figj6] that generally the system throughput increases as the number of APs 
increases, which suggests that the scheme that partitions the available spectrum and assigns them to the 
APs with different geographical locations is indeed more favorable than the scheme which uses a single 
AP to manage all the spectrum. This phenomenon can be explained partly by reasoning as follows: when 
using a single AP, it is likely that many of the CUs are located far away from the AP, and thus none of 
their channels have good quality; on the other hand, when using multiple APs, although each CU can 
only use part of the available channels, it is more probable that there is one or more APs that are located 
in its vicinity, thus is able to provide good channel quality. We have to mention here that, although not 
shown in Figl6l placing too many APs in the network may also result in reduced network throughput as 
the chance of idle APs increases as the number of AP increases (an extreme case is that the number of 
APs is larger than the number of CUs). 

VIII. Conclusion 

In this paper, we addressed the joint AP association and power allocation problem in a CRN. We 
formulate the problem into a non-cooperative game with hybrid strategy space. We characterized the NE 
of this game, and provided distributed algorithms to reach such equilibrium. Empirical evidence gathered 
from simulation experiments suggests that the equilibrium has very promising quality in term of the 
system throughput. 

There can be many future extensions to this work. First of all, the non-cooperative game with hybrid 
strategy space analyzed in this paper can be applied to many other problems as well, for example, the 
CRN with interference channel and segmented spectrum mentioned at the end of section II-AI Secondly, 
for the problem considered in this work, it is beneficial to characterize quantitatively the efficiency of 
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the JEP, and to provide solutions for efficiency improvement. Thirdly, it will be interesting to analyze 
the effect of time-varying channel gains and the arrival and departure of the CUs on the performance of 
the algorithm, and to propose suitable heuristic dealing with these situations. 

Appendix A 
Proof of Proposition [T] 

Proof: Choose c G C and suppose that at time t, c* = c. If c is an equilibrium association profile, 
then with probability at least (jj)'^ (all CUs chooses a*(i) = c*(f)), we have that a* is an equilibrium 
association profile. Because c* = c happens infinitely often, we must have a* = c happens infinitely 
often, i.e., there exists an equilibrium association in the set A. 

Suppose c is not an equilibrium association. Then consider the following steps of operation. 

Step 1): With probabihty at least (jj)^ , a* = c, and c e A. Because c is not a NE, then without loss 
of generality, assume that CU i is better off by switching to Wi'. c*+^(i) = Wi ^ c*(i). Then we must have 
that P(a*) < P (({uj, a^-)), a fact from (1231 ). Then with probability at least (jj)^ (all player j except 
player i choose a^~^^{j) = c*(j) = a*(j)) that a*"^-"^ = {'Wi,a^_-), and we have P(a*+^) > -P(a*). Put 
index i in the the setU : U = {i}. We note in this stage, we have:a*+^(i) = c*+^(i). Similarly, at t + 2, 
if we are able to find a CU j ^ i with c*+^(j) = idj / a*+^(j) (i.e. CU j is better off if switching to AP 
Wj), we let U = Then again with positive probability, we have a*+^ = {wj, sl^^j'). Consequently, 

P(a*"'~^) > P(a*+^). We note in this stage, the following is true: a*"'~^(j) = c*"'~^(j). Continue this 
process, until we reach a time t + 7i <= t + N such that only CUs in the set U are willing to switch. 
Let £ be the complement set of U. 

Step 2): We must have that for j G £, a*+"~^(j) = c*+"(j). On the other hand, for all i eU, from 
the argument in Step 1), we see that there must exist a < ki < n such that c*+^'(z) = a*+"~^(i). Pick 
q £ U such that q = argmirijg^ c*+"(i)7^a'+"-i(j) Consequently, we can shift c* out of the memory 
and still be able to construct a*+" = (-Wg, a*jg"~^) with positive probability, because all the elements in 
a*^"~^ must have been appeared once in {c*}^j|"_|_i- Move q out of U and into £, and continue Step 2) 
until only CUs in the set £ are willing to switch. Switch the role of U and £, and continue Step 2). 

By continuously performing the above operations, {P(a*)}^^ is a strictly increasing sequence, and 
there must exist a finite time instance T < oo such that it is not possible to find a*+^+^ that differs 
with a*+^ with a single element and has the property P(a*+^+^) > P(a*"'"^). Consequently, a*+^ is an 
equilibrium profile. We let a* = a'+^.The finiteness of T comes from the finiteness of the number of 
values of -P(a) (due to the finiteness of the choice of a). Such finiteness combined with the strict positivity 
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of the probability of performing each operation in Step 1) and Step 2) impUes that the probabiUty of 
reaching a* from c is non-zero. 

We conclude from the above analysis that with positive probability, a NE profile a* will appear after 
a* in finite steps. Because a* = c happens infinitely often, we must also have that a* happens infinitely 
often, i.e., a* G A. 

Finally, it is straightforward to see that the fact that a* is an equihbrium association profile suggests 
that the tuple (a*,p*(a*)) is a JEP. ■ 

Appendix B 
Proof of Theorem [3] 

Proof: Suppose that at time t + 1, it is CU i's turn to move. Let w = a*(«) be the CU i's associated 
AP at time t. We have the following two situations. 

1) At time t + 1, CU i is best off switching to w ^ In this situation, CU i's communication 

rate at time t under association profile a* is as follows: 

P^(pL;a*) - P»(p*_,,„;a*+i) P„(pL;a*) - P„(pL+i; a*+i) (39) 

where (a) and (6) are true because of the fact that at time t + 1, CU i no longer associates with AP w, 
and all other CUs keep their power and association profile the same as in time t. 
CU i's communication rate after it finishes switching to {5 is: 

= Pffi(p*+\p*5;a*+i) - Pffi(pt;;a*) = Pffi(p*+i; a*+i) - Pffi(pt;; a*). (40) 
Because Ri{^\+} ,^%]w) > i?i(p* p*.^ w), we have: 

Pfi;(p*+i;a*+i) + P„,(pL+^a*+i) > P^(p*^; a*) + Pffi(p*,; a*). 

Arguing similarly as in (|22] |. we have that SEP must satisfy: P(p*"^^; a*+^) > P(p*;a*). 

2) At Time t + 1, CU i stays in w. Notice that in this case, we have a*+^ = a*. From Proposition 3 
of HI, we have the following inequality:P^(p*+\ p*+^„; a*+i) = P^(p*+\ p*_,^; a*) > P^(p*^, p*_,^; a*), 
thus, P(p*+^; a*"*"^) > P(p*; a*). We conclude that in both cases, the system potential is non-decreasing. 
Because P(p*;a*) is upper bounded, |P(p*;a*)}^^ is a converging sequence. ■ 
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Appendix C 
Proof of Proposition [3] 
Proof: From Proposition |2l if a G we have that lim„_).oo p^"'**^ = (a) ) V i G M, which 
implies that lim„_j,oo I*^^'*^^ = «,(^))^ ^ ^ This result combined with the continuity of the function 
R* (fi^^^;w) with respect to I^J^'''^ and the continuity of the function = Ri (^pj^"'''^ a*("'^)(i)) 

with respect to pj^"'^\ further implies that, for any 5 > 0, there must be a N{5) such that for all n > N{6), 
the folio wings are true: 

max R* [^%^'^^'^^) - -R* (!*.«, (a); w) < S, 

^f"''^)-i?.(p*(a);a(*))| <<5. (41) 

For any w ^ such that w € Bi (I* (a), a), there must exits a e^j > such that: 

R* (llMw) - R. (l*,(,)(a);a(z)) > 6„,. (42) 

Take e > such that e = min^g^.^j.^a) ^(j)) ^^id choose a 6 small enough such that < 26 < e, and 
let N*{ai) = N{6). We have that for all n > N*{a), the following is true: 



i?,r(I*^(a);u;)-i?,(^I*,(,)(a);a(z)^ 
= R, (l*Ja); ^) + i?* (l*!:^^); u;) - ^ (l^^^'^^ + i?f "'^^ - i?f "'^^ - i?. (lL(.)(a); a(*)) 



K(l*„(a);^)-i?*(l*;r);u;) + i?f "■'^^ - i?, ( (a); a(z) 



Consequently, we have that for all n > iV*(a), R* fl^f - > e - 2?> 0, which implies 



that w must be in the set Bi ( I*^"''^^ a*^"''*)(i) ) . The claim is proved. 
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